Improving Identification Accuracy by Extending Acceptable Utterances in Spoken Dialogue System Using Barge-in Timing

نویسندگان

Kyoko Matsuyama

Kazunori Komatani

Toru Takahashi

Tetsuya Ogata

Hiroshi G. Okuno

چکیده

We describe a novel dialogue strategy enabling robust interaction under noisy environments where automatic speech recognition (ASR) results are not necessarily reliable. We have developed a method that exploits utterance timing together with ASR results to interpret user intention, that is, to identify one item that a user wants to indicate from system enumeration. The timing of utterances containing referential expressions is approximated by Gamma distribution, which is integrated with ASR results by expressing both of them as probabilities. In this paper, we improve the identification accuracy by extending the method. First, we enable interpretation of utterances including ordinal numbers, which appear several times in our data collected from users. Then we use proper acoustic models and parameters, improving the identification accuracy by 4.0% in total. We also show that Latent Semantic Mapping (LSM) enables more expressions to be handled in our framework.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analyzing user utterances in barge-in-able spoken dialogue system for improving identification accuracy

In our barge-in-able spoken dialogue system, the user’s behaviors such as barge-in timing and utterance expressions vary according to his/her characteristics and situations. The system adapts to the behaviors by modeling them. We analyzed 1584 utterances collected by our systems of quiz and news-listing tasks and showed that ratio of using referential expressions depends on individual users and...

متن کامل

Analyzing temporal transition of real user's behaviors in a spoken dialogue system

Managing various behaviors of real users is indispensable for spoken dialogue systems to operate adequately in real environments. We have analyzed various users’ behaviors using data collected over 34 months from the Kyoto City Bus Information System. We focused on “barge-in” and added barge-in rates to our analysis. Temporal transitions of users’ behaviors, such as automatic speech recognition...

متن کامل

Predicting Barge-in Utterance Errors by using Implicitly-Supervised ASR Accuracy and Barge-in Rate per User

Modeling of individual users is a promising way of improving the performance of spoken dialogue systems deployed for the general public and utilized repeatedly. We define “implicitly-supervised” ASR accuracy per user on the basis of responses following the system’s explicit confirmations. We combine the estimated ASR accuracy with the user’s barge-in rate, which represents how well the user is ...

متن کامل

Enabling a user to specify an item at any time during system enumeration - item identification for barge-in-able conversational dialogue systems

In conversational dialogue systems, users prefer to speak at any time and to use natural expressions. We have developed an Independent Component Analysis (ICA) based semi-blind source separation method, which allows users to barge-in over system utterances at any time. We created a novel method from timing information derived from barge-in utterances to identify one item that a user indicates d...

متن کامل

Handling rich turn-taking in spoken dialogue systems

This paper discusses how to build a system that can engage in a mixed-initiative human-machine spoken dialogue in which system utterances sometimes overlap with user utterances and vice versa. In the method, a module that incrementally understands user utterances and another module that incrementally generates system utterances work in parallel, and the timing of taking and releasing the dialog...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Improving Identification Accuracy by Extending Acceptable Utterances in Spoken Dialogue System Using Barge-in Timing

نویسندگان

چکیده

منابع مشابه

Analyzing user utterances in barge-in-able spoken dialogue system for improving identification accuracy

Analyzing temporal transition of real user's behaviors in a spoken dialogue system

Predicting Barge-in Utterance Errors by using Implicitly-Supervised ASR Accuracy and Barge-in Rate per User

Enabling a user to specify an item at any time during system enumeration - item identification for barge-in-able conversational dialogue systems

Handling rich turn-taking in spoken dialogue systems

عنوان ژورنال:

اشتراک گذاری